Metadata Driven Address Validation

SubscriptionThis content is available for Talend Academy subscription users. Open implementation accelerators - EN

 

Address verification and cleansing require the use of third-party address cleansing tools. These tools compare the addresses against reference data (usually acquired from postal services), using fuzzy matching or other probabilistic algorithms. Each tool returns a standardized and verified address and a sequence of codes that indicates the quality and accuracy of the results.

 

However, these codes, as well as the quality of the reference data, differ between tools. Additionally, implementing these third-party tools can be quite complicated. The metadata driven address validation framework provides a unified approach and a simple way to implement these tools in the data quality activities.

 

This configurable framework can cleanse an address against the cloud versions of several third-party address cleansing tools. It can retrieve the possible return codes, apply rules based on different countries, and then, based on the verification level, accept, reject, or send the addresses to Talend Data Stewardship to be reviewed by a data steward.

 

This framework integrates Talend code with the Melissa Data and Loqate third-party tools.

 

Prerequisites:

 

Talend Academy training:

  • Data Integration Basics
  • Data Integration Advanced
  • Talend Cloud Essentials
  • Talend Data Stewardship for data stewards or Talend Cloud Data Stewardship

 

Familiarity with the following subjects for all third-party tools:

  • Validation processes: address capture versus address verification versus address cleansing
  • Deliverable address and postally required elements
  • Partially versus fully verified addresses
  • Geographical coverage and address quality of reference data
  • Available reference data add-ons
  • International datasets, dependencies, territories, and other countries
  • Address formatting: available layouts, abbreviations, transliteration, and diacritics
  • Geocoding